Serial No.: 09/692,995 
Art Unit: 2623 

Please amend the present application as follows: 
In the Claims 

The following is a copy of Applicants' claims that identifies language being added with 
underlining (" ") and language being deleted with strikethrough (" — "), as is applicable: 

1-79. (Canceled) 

80. (Previously presented) A method implemented by a television set-top terminal 
(STT) coupled via a bi-directional communication network to a server located remotely from the 
STT in a cable television headend, said method comprising steps of: 

receiving via a tuner in the STT a video presentation provided by the server located in the 
cable television headend, wherein the video presentation is a video-on-demand 
presentation; 

outputting by the STT at least a portion of the video presentation as a video-on-demand 
television signal; 

receiving a first user input associated with bookmarking a visual scene contained in the 
video presentation, including receiving a character sequence to be assigned to the 
visual scene while the video presentation is being presented to the user; 

storing information related to the visual scene in a memory of the STT responsive to 
receiving the first user input, including storing only in the memory of the STT 
information related to the visual scene in response to receiving the first user input, 
including storing only in the memory of the STT data corresponding to the 
character sequence in response to receiving the user input configured to assign the 
character sequence to the visual scene; 
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outputting by the STT at least another portion of the video presentation as a video-on- 
demand television signal; 

receiving a second user input configured to request from the headend the visual scene in 
the video presentation after the STT has output the at least another portion of the 
video presentation; 

responsive to receiving the second user input, requesting by the STT that the headend 
send the video presentation beginning from the requested visual scene; 

receiving by the STT from the headend the video presentation beginning from the 
requested visual scene; and 

outputting by the STT a video-on-demand television signal comprising a portion of the 
video presentation starting from a location corresponding to the visual scene 
responsive to the second user input, wherein the location corresponding to the 
visual scene is identified by the STT using the information related to the visual 
scene, including using information related to the visual scene stored only in the 
STT. 

81. (Canceled) 

82. (Previously presented) The method of claim 80, further comprising receiving a 
plurality of user inputs configured to assign a plurality of respective character sequences 
corresponding to a plurality of respective visual scenes that were bookmarked responsive to a 
plurality of respective user inputs. 
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83. (Previously presented) The method of claim 80, further comprising the step of: 
receiving a user input configured to request information related to the visual scene in the 

video presentation; and 
providing the requested information responsive to receiving the user input configured to 
request information. 

84. (Canceled) 

85. (Previously presented) The method of claim 80, further comprising outputting 
information confirming that the visual scene has been bookmarked, wherein the information 
overlays a minority portion of a television screen being used to display the video presentation. 

86. (Previously presented) The method of claim 85, wherein the information 
confirming that the visual scene has been bookmarked includes at least one of a banner and an 
icon. 

87 - 89. (Canceled) 

90. (Previously presented) The method of claim 80, wherein the visual scene is 
associated with a bookmark list associated with a plurality of visual scenes associated with a 
plurality of respective user inputs. 
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91. (Previously presented) The method of claim 80, further comprising associating a 
plurality of visual scenes with a plurality of respective bookmark lists associated with a plurality 
of respective users responsive to a plurality of respective user inputs. 

92. (Previously presented) The method of claim 80, further comprising associating a 
plurality of visual scenes with a plurality of respective bookmark lists associated with a plurality 
of respective video presentations responsive to a plurality of respective user inputs. 

93. (Previously presented) The method of claim 80, further comprising: 

after expiration of a rental access period corresponding to the video presentation, 
prompting the user to provide input indicating whether the information is to be 
deleted from the memory of the STT. 

94. (Previously presented) The method of claim 80, further comprising: 

storing an image corresponding to the visual scene in a memory of the STT responsive to 
receiving the first user input. 

95. (Previously presented) The method of claim 80, wherein the second user input 
requesting the visual scene corresponds to a thumbnail image corresponding to the visual scene, 
the thumbnail image being simultaneously provided with a plurality of thumbnail images 
corresponding to a plurality of visual scenes in the video presentation. 
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96. (Currently amended) A television set-top terminal (STT) coupled via a bi- 
directional communication network to a server located remotely from the STT in a cable 
television headend, said STT comprising: 

a tuner configured to receive a motion video presentation provided by the server located 
in the cable television headend, wherein the video presentation is a video-on- 
demand presentation; 
a memory; and 

a processor that is programmed to enable the STT to, 

output at least a portion of the motion video presentation as a video-on-demand 
television signal, 

store information related to a visual scene contained in the motion video 
presentation only ef in the memory of the STT responsive to the STT 
receiving a first user input associated with the visual scene, without 
stopping output of the motion video presentation, wherein the first user 
input includes a character sequence to be assigned to the visual scene, and 
wherein the information related to the visual scene includes data 
corresponding to the character sequence, 

output at least another portion of the motion video presentation as a video-on- 
demand television signal, 

receive a second user input configured to request from the headend the visual 
scene in the video presentation after the STT has output the at least 
another portion of the motion video presentation, 

responsive to receiving the second user input at the STT, request that the headend 
send the motion video presentation beginning from the requested visual 
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scene, 

receive from the headend the motion video presentation beginning from the 
requested visual scene, and 

output responsive to the STT receiving a second user input a video-on-demand 
television signal comprising a portion of the motion video presentation 
starting from a location corresponding to the visual scene, including using 
information related to the visual scene stored only the memory of the STT, 
wherein the video-on-demand television signal comprising the portion of the motion 

video presentation starting from a location corresponding to the visual scene is 

output after the at least another portion of the motion video presentation is output 

as a video-on-demand television signal. 

97. (Previously presented) The STT of claim 96, wherein the visual scene is 
associated with a bookmark list associated with a plurality of visual scenes corresponding to a 
plurality of respective user inputs. 

98. (Previously presented) The STT of claim 96, wherein the processor is 
programmed to associate a plurality of visual scenes with a plurality of respective bookmark lists 
associated with a plurality of respective users responsive to a plurality of respective user inputs. 

99. (Previously presented) The STT of claim 96, wherein the processor is 
programmed to associate a plurality of visual scenes with a plurality of respective bookmark lists 
associated with a plurality of respective motion video presentations responsive to a plurality of 
respective user inputs. 
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100. (Previously presented) The STT of claim 96, wherein the processor is configured 
to prompt the user to provide input indicating whether the data is to be deleted from the memory 
of the STT. 

101. (Previously presented) The STT of claim 96, wherein the processor is configured 
to enable the STT to store in the memory an image corresponding to the visual scene responsive 
to receiving the first user input. 

102-122. (Canceled) 



